Which of the following techniques can be used for the purpose of keyword normalization, the process of converting a keyword into its base form?
1 and 2
2 and 4
1 and 3
1, 2 and 3
Which are python libraries used in NLP?
pandas
spacy
nltk
All of above
What is the name for the space inside which a robot unit operates?
danger zone
spatial base
work envelop
environment
What are the possible features of a text corpus Count of word in a document Boolean feature – presence of word in a document. Vector notation of word Part of Speech Tag Basic Dependency Grammar Entire document as a feature
12345
1234
123
12
Which step is the process of breaking down documents into smaller units of analysis?
Ngrams
Tokenization
Stopwords
Corpus
Which is a model of measuring the incidence of known words?
A low weight in TF-IDF
A high weight in TF-IDF
A corpus
A bag of words
Tick what is true about WordNet from the following sentences.
A machine-readable thesaurus, and aspects of a dictionary
A hierarchically organized lexical database
Is a lexical database of semantic relations between words
From the sentence “Fintech Online Course”, how many bigrams can be created?
2
3
4
1
Which function would you use to retrieve the list of unique words?
fit_tranform()
CountVectorizer()
get_feature_names()
download()
Morphotacticsis a model of____________
How and which morphemes can be affixed to a stem
Spelling modifications that may occur during affixation
N-grams of affixes and stems
All affixes in the English language
Which is a high term frequency and low document frequency?
N-grams are defined as the combination of N keywords together. How many bi-grams can be generated from a given sentence:
“Analytics Vidhya is a great source to learn data science”
7
What were the objectives of NLP day 1?
Implementing ngrams and word clouds
Tokenizing texts into sentences and words
Implement lemmatization and stop wording
A vader compound score of 1.02 evaluates to__________
positive sentiment
neutral sentiment
All of the above
None of mentioned above
Which are common words usually removed in an NLP analysis?
Which is the main Python package we use for NLP?
NLTK
NLP-LIB
Scikit-Learn
PyNLP
Which of the following Affective States does Sentiment Analysis mostly focus on?
Personality Traits
Mood
Emotion
Attitudes
Difficulties/Challenges in Word Sense Disambiguation (WSD) .Tick which is (FALSE) from the statements given below.
Completely different algorithm might be needed for different applications
to decide the sense of the word because different senses can be very closely related
The problem of Inter-judge variance as the WSD systems are generally tested by having their results on a task compared against the task of human beings
Words can be easily divided into discrete sub-meanings
How do we get from NLP text analysis to stock price correlation?
Convert parts of speech to categorical variables
Transform some NLP results into features
Recognize some named entities
Which is the process of turning different morphologies (i.e. versions) of a word into its base form?
Lemmatization<br>
Which are included in named entity recognition?
Currency
Time and dates
Nouns
All of above<br>
Which is a collection of documents?
How many trigrams phrases can be generated from the following sentence, after performing following text cleaning steps:
“#Analytics-vidhya is a great source to learn @data_science.”
What does spaCy tagging do?
Identifies word order relationships
Identifies more frequent words
Identifies importance and relevance
Identifies parts of speech
Which function would you use to implement a bag of words by creating a matrix of token counts?
. ____________ is a Python library to make programs that work with natural language.
Pandas
BeautifulSoup
Seaborn
Tick whichever is an application Named Entity Recognition (NER)
Efficient Search Algorithms
Analysis the rude behavior from customer feedback
Classifying content for NEWS providers
What does NLP stand for? Write what the acronym means.
Netware Lite Protocol
Natural Language Processing<br>
Nationwide Loan Processing
None of these
In a word cloud, what does the size correspond to?
Importance
Frequency<br>
Length
Relation
N-grams are defined as the combination of N keywords together. How many bi-grams can be generated from given sentence:“NPTEL videos are a great source to learn engineering courses”
6
9
8
What are the names of lecturers in this NLP part?
Sebastian Poliak
Stefan Kosztolanyi
None of above
Why we use named entity recognition in NLP?
Breaking sentences into words
Classify entities into predefined labels
Creating a set of vocabularies
None
Which of the following sense for the word “language” is not available in wordnet? i. a systematic means of communicating by the use of sounds or conventional symbol ii. communication by word of mouth iii. the cognitive processes involved in producing and understanding linguistic communication iv. the style of a piece of writing or speech v. the mental faculty or power of vocal communication
keeps word order, keeps word multiplicity
keeps word order, disregards word multiplicity
disregards word order, disregards word multiplicity
disregards word order, keeps word multiplicity
Which are multiple word sequences?
What is the field of Natural Language Processing (NLP)?
Artificial Intelligence
Linguistics
Computer Science
All of the mentioned
Which company’s tone analyzer service did we discuss?
Google
Apple
Amazon
IBM
Which news sources did we use?
Reuters
News API
What were the objectives of NLP day 2?
Use NLTK and VADER to classify news as positive, negative, or neutral
Perform data preparation techniques for sentiment analysis
Analyze sentiments and tone from news feeds
. Machine Translation is that convert_____________
Any human language to English
Human language to machine language
One human language to another
Machine language to human language
What kind of charts are used in visualizations of results?
Scatterplot
Barchart
Windowplot
Both A & B
0 Comments